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Abstract —We introduce a ’’one shot” random access procedure 
where users can send a message without a priori synchronizing 
with the network. In this procedure a common overloaded control 
channel is used to jointly detect sparse user activity and sparse 
channel profiles. The detected information is subsequently used 
to demodulate the data in dedicated frequency slots. We analyze 
the system theoretically and provide a link between achievable 
rates and standard compressing sensing estimates in terms of 
explicit expressions and scaling laws. Finally, we support our 
findings with simulations in an LTE-A-llke setting allowing ’’one 
shot” sparse random access of 100 users in 1ms. 

I. Introduction 

Sporadic traffic generating devices, e.g. machine-type com¬ 
munication (MTC), are most of the time inactive but regu¬ 
larly access the Internet for minor/incremental updates with 
no human interaction IT]. Sporadic traffic will dramatically 
increase in the 5G market and, obviously, cannot be handled 
with the bulky 4G random access procedures. Two major 
challenges must be addressed: 1) unprecedent number of 
devices asynchroneously access the network over a limited 
resource and 2) the same resource carries control signalling 
and payload. Dimensioning the control channels according to 
classical theory results in a severe waste of resources which, 
even worse, does not scale towards the requirements of the 
loT. On the other hand, since typically user activity, channel 
profiles and message sizes are compressible within a very 
large receive space, sparse signal processing methodology is 
a natural framework to tackle the sporadic traffic. 

Preliminary Work. The key findings of sparse signal 
processing are that in an under-determined system undergoing 
noise the signal components can be indeed identified if 1) 
the measurements are suitably composed and 2) the signal 
space is sparse (or more generally ’’structured”), i.e. only a 
limited number of elements in some given basis are non-zero. 
It has then soon been recognized that this can be exploited for 
multiple access with sparse user activity (see m for a recent 
overview). This was extended to asynchronous fading channels 
El as well as asynchronous multipath block fading channels 
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with known/unknown (sparse) multipath channel ||4l. Notably, 
all these concepts are fundamentally different from a classical 
’’overloaded” CDMA channel without sparsity 0 where user 
activity and data detection are separate steps. 

Recently, massive MTC random access for 5G has fuelled 
the topic, particularly within the EU projects METIS and 
5GNOW Q, El, |l6l. Ref. Q investigates the interaction of 
advanced multiuser detection and a random access scheme 
called coded slotted ALOHA. However, the effect of channel 
estimation and data detection errors (which is necessary due to 
broadband) and error propagation in the intereference cancel¬ 
lation scheme is crucial 13 and must be carefully considered 
in follow-up work. In E) new coding schemes and limits 
for MTC with random arrivals in a slotted ALOHA (using 
either time or frequency slots) have been presented. While 
narrowband in nature and without advanced sparsity promoting 
multiuser detection, it is argued that for such scenarios the ef¬ 
fect of channel estimation errors becomes negligible (however, 
implicitely, by using arbitrary long preambles!). Consequently, 
recent concepts deal either with data or channel estimation and 
an overall architecture which includes identification, channel 
estimation, asynchronicity and data detection in ’’one shot” is 
an open topic. 

Contributions. In this paper, we propose a ALOHA 
(O)EDMA protocol similar to El where users select fre¬ 
quency slots of flexible size without coordination. In such 
asynchronous scenario, it becomes very inefficient to reserve 
control ressource for every slot since delay spread is large, 
i.e. coherence bandwidth is small. We suggest an overloaded 
common control channel which is accessed by all active users 
at the same time, see Eig. [T] Sparse signal processing will 
be used for joint (sparse) user activity detection and (sparse) 
channel estimation. One control channel concept is that data 
and control channels are superimposed so that control is spread 
over the whole signal space and collected back within some 
(small) observation window ID. Another concept which is 
followed here is to fully separate control and data which 
requires a careful investigation of actual payload vs. control 
signaling ratio. In the following sections the details of such 
sparsity aware random access scheme are outlined, analyzed 
and simulated. 

Notations. ||x||^, = (Ei is the usual notion of 

fq-norms and Ikll := Wxh.,, denote with supp(a:) := {i : 
Xi '.= (cijx) 0} the support of x in a given fixed (here 




Fig. 1. Random access concepts: a) standai'd procedure b) 5GNOW overlayed control channel El. 0 c) 5GNOW common control channel (in this paper) 


canonical) basis The size of its support is denoted 

as |la:||f(, := |supp(a;)|. W is the (unitary) Fourier matrix with 
elements [W)ki = for k,l = 0 .. .n—1, hence, 

W~^ = W* where W* is the adjoint of W. We use here also 
X = Wx to denote Fourier transforms and © means point- 
wise product. In is the identity matrix in C", diag(x) is some 
arbitrary diagonal matrix with x G C” on its diagonal. 

II. Compressive random access 
A. "One-shot” transmitter 

Let pu G C" be an pilot (preamble) sequence from a given 
(random) set V C C" and Xu G C" be an unknown (uncoded) 
data sequence Xu G T”” C C" both for the u-th user with 
u G {!,...,[/} and C7 is the (fixed) maximum set of users in 
the systems. Note that in our system n is a very large number, 
e.g. 24k. Due to the random zero-mean nature of Xu we have 
^E\\pu-\- XuW^ = 1, i.e. the total (normalized) transmit power 
is unity. Provided user u is active we set; 

a:=-|bu|P and a':= 1 - a =-EWxuW^ 
n n 

Hence, the control signalling fraction of the power is a. If a 

user is not active then we set both Pu = Xu = 0, i.e. either a 

user is active and seeks to transmit data or it is inactive. 

We will use a cyclic model, which is achieved with OFDM- 
like signaling and the use of an appropriate cyclic prefix and 
restrict our model here to time-invariant channels. Each vector 
hn G denotes the sampled channel impulse response 

(CIR) where Tcp is the length of the cyclic prefix. We assume 
to have a priori support knowledge on each hu- (i) bounded 
support, i.e. supp(/i„) C [0,...,Tcp — 1] due to the cyclic 
prefix and (ii) sparsity, i.e. ||/iu||io < ki. Eventually, we 
assume that only /c 2 users out of U in total are actually active. 
Define k := kik 2 . 

In an OEDM system the EET size n is then chosen as n ^ 
Tcp. Let [/i, 0] G C" denote the zero-padded CIR. With these 
assumptions the received signal is then: 

u-i 

y = ^ circ([/i,0])(pu Xu) + e 

u—0 

VB = 


Here, circ([/i, 0]) G C" denotes the circulant matrix with \h, 0] 
in its first column; denotes the overall measurement matrix 
(to be specified later on) referring to a ’’frequency observation 
window” B. The number of subcarriers in S is to := \B\. 
The AWGN is denoted as e G C" with E{ee*) = a^In- Eor 
circular convolutions we have circ([/i, 0])p = ^/n ■ W* {h 0 p) 
so that; 

u 

y = '£w* 

U — 1 

VB = ^BV 

where e and e are statistically equivalent. 

Eor the users’ data the entire bandwidth B^ is dived into B 
frequency slots. A standard assumption is that users’ arrivals 
are modeled as an Poisson process with rate A which is 
even true if retransmissions are incorporated E). Each user 
selects a slot in a ALOHA (O)EDMA fashion. Clearly, by 
the single interval contention period the users will choose 
the same EDMA slot with some probabilty, called outage 
event. Obviously, for a fixed rate requirement throughput 
maximization means outage probability minimization. Define 
the ordered user rates as i?i, i? 2 , Ru- Then, it is shown in 
il that the average throughput (for any user k) is given by: 

T (A,7^) = Aexp 

with a rate constraint TZ. In the following we assume that the 
probabilties Pr(i?fe > TV) are mainly dependent on the receive 
powers (user position, slow fading effects) while the fast 
fading effects are averaged out due to coding over subcarriers. 
Hence, user rates are ergodic and are calculated as expectations 
over the fading distributions. The relevant expressions under 
erroneous channel estimation will be provided in this paper. 

B. Receiver operations 

All performance indicators depend on the number of sub¬ 
carriers in B (control) and B’^ (data). The goal is the limitation 
to a small observation window B. Let Pg : C" —C"* be the 
corresponding projection matrix, i.e. the submatrix of with 
rows in B. Eor identifying which preamble is in the system we 



y Py {Rk > TZ) ■ TZ 




































can consider y and use the frequencies in B, i.e. = PbW, 
so that: 


UB ■= Pb 


u 


\ynhu 0 


^u) 


Pbb 


Notably in 0 we have introduced randomized pointwise 
multipliers ^ G C" in time domain instead, denoted by the 
corresponding nxn diagonal matrix := diag(^), which is 
favorable for the sparse recovery. Hence, the m x n sampling 
matrix = PbWM^ is considered which, however, comes 
at the cost of performance loss due to superimposed pilot 
subcarriers. 

For algorithmic solution, we can stack the users as: 


probabilities for (i) and (ii). Refined estimates which include 
(iii) and (iv) require more assumptions on the sampling model, 
noise distribution and recovery procedure and will therefore 
appear in a separate work. 

A. RIP and performance guarantees 

Let Efe := {x G C" : ||a;||£o < k} denotes the fc-sparse 
vectors. A matrix <1) is called fc-RlP if exists 0 < 4 < 1 
such that |||$a;||^ — ||a;|p| < 4||a;|P for all x G Efc. There 
is a well-known result HD on BPDN and we call this as the 
Qi-estimator for x given y, i.e. x = Qi{y): If <1 >b is 2fc-RlP 
with 4fe < \/2 — 1 and I|e||r2 < e then: 


u 

2/ = ^ civc{hu)ipu +Xu) + e 

U — 1 

= D{p)h + C{h)x + e 

where D{p) := [circ(pi),... ,circ(p( 7 )] G ^.'n'xUn 
C{h) := [circ([hi, 0]),... ,circ([/i( 7 ,0])] G are 

the corresponding compound matrices, respectively p = 
[pT P 2 ■■■PuV' ^ ^2 corresponding 

compound vectors. If we assume each user-channel vector /i„ 
to be /ci-sparse and k 2 are active then h is fc-sparse. 

For joint user activity detection and channel estimation 
exploiting the sparsity we can use the standard basis pursuit 
denoising (BPDN) approach: 

argmin||h||£i s.t. \\^b D{p)h - yWn^ < e (1) 

h 

Moreover, several greedy methods exists for sparse recon¬ 
struction. In particular, for CoSAMP ifTOl explicit guarantees 
in reconstruction performance are known and can be used 
instead of BPDN. After running the algorithm in eqn. ([Til the 
decision variables Vu, are formed, indicating that if 

11411^2 > ^ where ^ > 0 is some predefined threshold the user 
is considered active and its corresponding data is detected. 
In 0 the correponding pilot signal is subtracted from the 
recieved signal by interference cancellation. Here, we assume 
full separation of data and control so that supp(pii) C B Vu. 
Denote the error of this operation as d := h — h. Hence, the 
received signal is given by: 

y = {y/nh + d) Q X + e (2) 

which a set of parallel channels each with power E{\xk\'^) = 
1 — a, IpP = a and E{\ek\'^) = tr^. 

HI. Performance analysis 

It is possible to find the scaling of rates if one makes the 
following assumptions: (i) all users employ independent Gaus¬ 
sian codebooks (ii) if a user is not detected the corresponding 
data is discarded (iii) if restricted isometry property (RIP) 
is not satisfied (see the discussion in following subsection), 
then the data of all is fully discarded (iv) if the actual noise 
vector is larger than the estimated noise vector then the data 
of all is discarded. In our analysis we will work with outage 


||(3i($a: -I- e) - x\\e^ < cie 

with Cl = 4-v/l + 4fc/l —(l + -\/2)4fc (in particular, for 4fc = 
.2 this gives ci = 8.5). It is known that 4fe 0 V2-1 is 
a necessary condition. In ifT^ the bound has been improved 
to 4fc 0 3/(4 -(- a/G) ~ 0.4652. Similar bound exists for 

CoSAMP mni, i.e. 4fe < Y^2/5-f r; 0.3843 US. 

Up this point we have mentioned uniform reconstruction 
guarantees (for any x) for a given matrix $ and these are 
related to its RJP-constant 4fe- L is still difficult to construc¬ 
tively design measurements matrices with sufficiently small 
RIP constants. In this paper we use the results in lfT4l : For 
any unitary matrix [/, the measurement matrix Pg • U with B 
chosen uniformly at random with cardinality m such that: 

m > c'(5“^p^fclog®(n) 

has RIP with probability > 1 — c"n~^ and 52k < 5, where 
p = /r(C/, Id) is the incoherence between U and the identity 
(standard basis). 


B. User detection 


Let us first calculate the propability of not detecting an 
active user PmdiO (’’missed detection”), and falsely detecting 
an inactive user PfaiO (’’false alarm”). Recall that for a given 
pilot power a the channel estimation error is d = h — h with 
ft = Qiiy/'/a). 

Theorem 1. We have for a fixed sampling matrix <i> with RIP- 
constant 4fe-' 


PmdiO < F{0 + 


Cr(0ci(4fc)^mo'^ 

ak2 


PfaiO < 


Cl(4fc)^w 


where Cr{^) is defined below in eqn. (|3). 

Proof: Since the system is symmetric we can consider any 
user u and drop the indices (•)„ in all user-specific variables. 
Furthermore, we also drop ' used to denote Fourier transforms. 
Abbreviate the dependent random variables by x := ||/i|| and 







y := ||d|| and let F(x) be the distribution of x. We have: 


Proof: We have: 


< Pr{|||d|| - ||/i|| < C} < Pr{a; - ^ < y} 

= Pr{x — C < 2/} n {a; < ^} + Pr{a; — ^ < y} r\ {x > ^} 

< Pr{a: < ^} + Pr{x — ^ < 2/} H {x > ^} 

= Pr{a: < ^} + Pr{y^ > {x — fl {cc > ^} 




<m 




where in the last step we use Markov’s inequality and: 


CriO ■ = 


r°° dF{x) 

Ji 


(3) 


and the last step follows since the f 2 -norm function is the sum 
of its squared terms. Hence, its expectation can be calculated 
as the expectation of the sum of partial user contributions. 
These partial user contributions depend only on the marginal 
distributions which are equal for all active users from which 
the result follows. 

The term iiJ(||(i|p) can be calculated as 


E{\\d\\^) = E{\\Q,{y/^)-hf) 

< ) 

^ ci{52kfrncj‘^ 

~ a 

The false alarm probability is calculated in a similar manner. 


For the (ergodic) achievable rates R (a) := E log(l + |/ip) 
per subcarrier with a random channel power \Wf for a given 
pilot/data power split a we can show the following: 

Theorem 2. Let the channel impulse response be k-sparse 
and use eqn. O as the channel estimate. The achievable rate 
R{a) per subcarrier for a particular user is lower bounded 
by: 

R (a) > F^/i|{||/i||>{} [log(l + (1 - a) \h\‘^a~'^)] (1 - Pmd) 

\ an J 

for a fixed sampling ^ obeying a RIP-constant d2k < \/2 — 1. 


Elog {1 + \h + E{d\h)\^+E{\d\^\H)) 

= Elog (1 + E{\h + Eid\h)\^)\h) + E{\df\h)) 

= Elog (1 + E{\h + E{d\h) + 5)nn)) 

= Elog{l + E{\h\^\h)) 

>Elog{l + \h\^) 

where the last line is due to Jenssen’s inequality. ■ 

Corollary 1. Note that Lemma Q] recovers Lemma 2 in m 
Theorem 1] by the fact that ifh is the minimum mean squared 
estimate (MMSE) of h then E{d\h) — 0 so that d = d and h 
and d are actually independent. Hence E{\'i)\'^\h) = i?(|c)p) = 
a and E{\h\‘^) = 1 — a so that: 

Elog{l + \hf) 

< Elog (1 + in + EidM"^ + E{\d\^\n)) 

= Eiogli + \nf + E(\i,f\h)) 

= log (l + (1 — a) + a) 

because h and h have the same distribution. 

proof of Theorem \2]; We have for each subcarrier and 
user (recall that we also dropped user indices (•)„ and the 
"-notation for the Fourier transform): 

y = {h + d)x + e 

where E{\x\'^) = 1 — a and i?(|ep) = a^. We can assume 
that X is independent of the tuple {h, d) but h and d are in 
fact not (they were if we had MMSE estimation). Linking the 
achievable mutual information I to MSB estimation as GSl 
Theorem 1] 

Iix;y\h) = H{x\h) - H{x\h,y) 

> H{x\h) — E ^log[7rei?(|x — aiyf |?i)]^ 

where real a is some free parameter. The optimal aiy is the 
MMSE of X so that in general by the orthogonality principle: 

ai=E{yx*\H)lE{\y\^\h) 

In the following we use cii for bounding the rate but without 
explizit calculating the term. Before we proceed let us rewrite 


To prove this theorem we need the following lemma: 

Lemma 1. Suppose h, h are random variables and h is an 
estimate of h (hence they are correlated). Denote the error as 
d = h — h and its unbiased version d = d — E(d\h). Then we 
have: 

Elog (1 + |/ip) < E;iog (1 + \h+E{d\K)\^ + E{\d\'^\h)) 


y = {h + E{d\fi) + d — E{d\h))x + e 
= {h + E{d\h) + 5))a: + e 

where 

5 = d — E{d\h) 

Hence, we have E{d\h) = 0 so that + E{d\h) and 5 are in 
fact orthogonal which is the desired goal. Using optimal ai 








we have after some tedious algebra: 


R[a) 

> Elog 


= Elog 


= Elog 


f Ei\x\^)\h + E{d\h)\^ + E{\x\^)E{\d\^\h) + 

V Ei\x\^)Ei\d\^\h) + a^ 

( E{\xnh + E{d\h)\^ \ 

V E{\x\^)E{\X)Y\h) + ) 

( {l-a)\h + E{d\h)\^ \ 

V ^ {I-a)E{mh) + a^j 


The capacity loss per subchannel in eqn. (|2]) can be upper- 
bounded by 


Ai? (a) 

, / (1 — a)h 

<Elog[l + ^ - ^ 


- E log 1^1 + 
= i? log (1 + 


{l-a)\h + E{d\h)\^ 
{l-a)E{\d\^\h) + a^ 
(1 — a)h 


- log 1 + 


+ T; log ( 1 + 


(1 -a)\h + Eidlh)]"^ + (1 - a)E{\d\^\h) 


il-a)E{mh) 


which by virtue of Lemma [T] gives: 

Clearly, we have E{\d\'^\h) < E{\d\'^\h) and get so the final 
result using: 

E{\^\'^\h) < 2E{\d\'^\h) < 

an 

We can finally incorporate the missed detection properly. ■ 
We also have the following upper bound. 

Theorem 3. Let the channel impulse response be k-sparse 
and use eqn. ([TJ as the channel estimate. The achievable rate 
per subcarrier is upper bounded by: 


R (a) < i?/i log 1 + 


(1 — a) ha 


1 + 




for a fixed sampling $ obeying a RIP-constant 52k < s/2 — 1. 
Proof: The prove follows from M- ■ 


IV. Simulations 

An LTE-A 4G frame consists of a number of subframes 
with 20MHz bandwidth; the first subframe contains the RACH 
with one ’’big” OFDM symbol of 839 dimensions located 
around the frequency center of the subframe. The FFT size is 
24578=24k corresponding to the 20MFIz bandwidth whereby 
the remainder bandwidth outside PRACH is used for scheduled 
transmission in LTF-A, so-called PUSCH. The prefix of the 
OFDM symbol accommodates delays up to lOOps (or 30km 
cell radius) which equals 3000 dimensions. In the standard 


the RACH is responsible for user aquisition by correlating 
the received signal with preambles from a given set. Here, 
to mimic a 5G situation, we equip the transmitter with 
the capability of sending information in ’’one shot”, i.e., in 
addition to user aquisition, channel estimation is performed 
and the data is detected. For this a fraction of the PUSCH is 
reserved for data packets of users which are detected in the 
PRACH. Please note the rather challenging scenario of only 
839 subcarrier in the measurement window versus almost 24k 
data payload subcarriers. 

In our setting, a limited number of users is detected out 
of a maximum set (here 10 out of 100). We assume that the 
delay spread is below 300 dimensions of which only a set of 
6 pathes are actually relevant. The pilot signalling is similar 
to 0 but modifed to fit the data/pilot separation. Fach active 
user sends 1000 bits in some predefined frequency slot. This is 
uniquely achieved by mapping the sequences to a slot. Hence, 
in the classical Shannon setting 100 users x (300 pathes H- 
1000 bits) = 130k dimensions are needed while there are only 
24k available! The performance results are depicted in Fig. |2] 
where we show show symbol error rates (SFR) over the pilot- 
to-data power ratio a. Moreover, in Fig. [3 we depict false 
detection probability Pfd (some user is detected while not 
active) over missed detection probability Pmd (user is active 
while not detected). We observe that, although the algorithms 
might not yet capture the full potential of this idea, reasonable 
detection performance can be achieved by varying a. In the 4G 
LTF-A standard a minimum Ppo = 10“^ is required for any 
number of receive antennas, for all frame structures and for 
any channel bandwidth. For certain SNRs a minimum Pmd = 
10“^ is required. It can be observed from the simulations that 
the requirements can be achieved. Actually, compared to 4G 
LTF-A where the control signalling can be up to 2000% m 
of a single resource element the control overhead is in the CS 
setting down to to 13% (let alone the huge increase in latency) 
in the best case. 


V. Conclusions 

In this paper, we provided ideas how to enable random 
access for many devices in a massive machine-type scenario. 
In the conceptional approach as well as the actual algorithms 
sparsity of user activity and channel impulse responses plays 
an a pivotal role. We showed that using such framework 
efficient ’’one shot” random access is possible where users 
can send a message without a priori synchronizing with the 
network. Key is a common overloaded control channel which 
is used to jointly detect sparse user activity and sparse channel 
profiles. Such common control channel stands in clear contrast 
to dedicated control signalling per ressource block, and is thus 
more efficent particularly for small ressoure blocks. Since each 
user also has channel state information for all subcarriers, 
there are additional degrees of freedom to place the ressource 
blocks. We analyzed the system theoretically and provided 
a link between achievable rates and standard compressing 
sensing estimates in terms of explicit expressions and scaling 
laws. Finally, we supported our findings with simulations in an 















Fig. 2. Averaged BPSK SER in 5G ”one-shot” random access in a 20MF[z 
LTE-A standard setting at (overall) SNR=20dB. m = 839 out of n = 24576 
dimensions are used for CS and sparsity of the channel is A: = 6. The total 
number of users is 100, out of which 3-20 are active. The control overhead 
is below 13% 



Eig. 3. PpD, Pmd over ^ and fixed a (such SER< 10 for the 5G 
”one-shot” random access 

LTE-A-like setting allowing ’’one shot” sparse random access 
of 100 users in 1ms with good performance. 
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